Information Structure and Pauses in a Corpus of Spoken Danish
نویسنده
چکیده
This paper describes a study in which a corpus of spoken Danish annotated with focus and topic tags was used to investigate the relation between information structure and pauses. The results show that intra-clausal pauses in the focus domain, tend to precede those words that express the property or semantic type whereby the object in focus is distinguished from other ones in the domain.
منابع مشابه
Annotating Information Structure in a Corpus of Spoken Danish
This paper presents the work done to annotate a corpus of spoken Danish with information structure tags, and describes a preliminary study in which the corpus has been used to investigate the relation between focus and intra-clausal pauses. The study indicates that the pauses that do fall within the focus domain, tend to precede property-expressing words by which the object in focus is distingu...
متن کاملStress, pauses, pronominal types and pronominal functions in Danish spoken data
In this paper we present a study of the relation between types of third personal singular neuter pronoun and their functions in Danish spoken data where stress information is marked so that personal and demonstrative occurrences of the pronouns can be distinguished. This study confirms that there are language specific differences in the way various types of pronoun are used to refer to abstract...
متن کاملThe Prosody of Discourse Structure and Content in the Production of Persian EFL Learners
The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...
متن کاملThe Value Of Minimal Prosodic Information In Spoken Language Corpora
This paper reports on an investigation into representing tone unit boundaries (pauses) as well as words in a corpus of spoken English. An analysis of data from MARSEC (Machine Readable Spoken English Corpus) shows that, for professional speakers, the inclusion of this nfinimal prosodic information will lower the perplexity of a language model. The analysis is based on information theoretic tech...
متن کاملThe Segmentation of Speech
This paper reports a phenomenon supporting the hypothesis that the emergence of structure in the evolution of language was a staged process. To develop a grammatical structure it seems necessary to first have discrete constituents which can be the building blocks of a hierarchical system. By analysing observed speech we show that the development of a linear sequence of grammatical constituents ...
متن کامل